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Abstract 

This  final  report  presents  a  summary  of  research  accompli  shed  v  for  the  AFOSR  under 
grant  number  AFOSR  82-0303  for  the  period  November  1986  -  February  1988Tto  investigate 
parallel  problem  solving.  Under  the  current  grant  a  parallel  problem  solving  system,  PRISM 
(Parallel  Inference  System),  that  was  implemented  on  the  VAX/11-780,  the  PYRAMID  and 
SUN  machines,  was  ported  successfully  to  McMOB  and  then  to  the  BBN  Butterfly  parallel 
architecture.  The  McMOB  architecture  is  essentially  the  ZMOB  architecture  with  16  Motorola 
68000  processors,  upgrading  the  Z80A  microprocessors,  interconnected  in  a  ring  structure. 

Experimental  testing  of  PRISM  on  McMOB  was  undertaken  in  the  current  year.  In  addi¬ 
tion,  several  enhancements  were  made  to  PRISM  to  permit  experimental  analyses  to  be  made, 
and  to  incorporate  additional  features  to  take  full  advantage  of  parallelism  in  a  problem  solving 
environment  The  tracing  and  statistical  gathering  packages  were  extended.  An  ability  to 
display  AND- parallelism  was  added  to  the  trace  program  which  displays  the  execution  of  a 
program  on  the  parallel  machines. 

In  addition  to  the  above,  work  continued  in  the  area  of  informative  answers  to  be 
presented  to  a  user.  Heuristic  techniques  were  developed  to  determine  which  information  to 
display.  v.  ■  ^  Grw-. 1  ><r££,*^-c  •  f  C- 

The  system  software  for  ZMQB/McMOB  is  ndw  robust  arid  considered  completed.  This 
has  allowed  us  to  reemphasize  our  studies  on  parallel  software.  A  new  formalism  for 
slicing/splicing  was  developed  which  eliminates  much  of  the  run-time  overhead  of  the  tech¬ 
nique,  allowing  for  the  development  of  a  splicing  compiler.  Work  has  also  focused  on  the 
development  of  debugging  tools  for  parallel  software  and  the  integration  of  artificial  intelli¬ 
gence  techniques  into  debugging  software. 


1.  Introduction 


Under  the  current  grant,  a  detailed  design  and  implementation  of  a  parallel  problem  solv¬ 
ing  system  based  on  logic,  PRISM  (Parallel  /nference  SysteAf),  was  implemented  on  the 
McMOB  parallel  processor  and  ported,  successfully  to  the  BBN  Butterfly  parallel  processor. 
The  McMOB  machine  has  the  identical  architecture  as  ZMOB.  It  differs  in  that  McMOB  has 
only  16  microprocessors  attached  to  the  belt  and  that  these  are  Motorola  68000  microproces¬ 
sors.  PRISM  was  previously  implemented  using  a  simulated  ZMOB  belt  on  VAX,  PYRAMID 
and  SUN  machines,  before  it  was  ported  to  McMOB.  PRISM  underwent  experimental  testing, 
was  enhanced  in  a  number  of  ways,  and  a  large  set  of  problems  was  tested  using  the  system. 


McMOB  was  also  made  operational  on  the  software  level.  The  operating  system 
software  for  McMOB  is  now  fully  functional  and  no  further  work  is  deemed  necessary  in  that 
area.  Further  research  was  performed  in  the  development  of  program  sheers  and  splicers  for 
automatic  parallelization. 


In  section  2  we  provide  a  description  of  the  accomplishments  under  the  current  grant.  In 
the  area  of  parallel  problem  solving,  the  initial  PRISM  has  been  fully  implemented  and  tested 
in  a  parallel  environment.  Extensive  experimentation  was  begun  on  evaluating  PRISM  on  a 
parallel  architecture.  An  approach  has  been  developed  to  utilize  heuristics  to  obtain  informative 
answers  for  queries  to  deductive  databases  and  problem  solving  systems,  in  the  area  of  paral¬ 
lel  systems  hardware  and  software,  the  McMOB  architecture  has  been  completed,  and  focus 
has  shifted  to  the  software  issues  involved.  A  new  formalism  has  been  developed  allowing  us 
to  exploit  regularities  in  the  structures  of  splicing,  drastically  reducing  the  run-time  overhead 
on  splicing  programs.  Further,  a  prototype  for  a  practical  automatic  debugging  aid,  using 
Artificial  Intelligence  techniques,  has  been  designed  and  implemented. 
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2.  Accomplishments  on  Effort  During  Period  November,  1986*  February,  1988 

This  section  is  subdivided  into  two  major  parts.  The  first  section,  2.1,  describes  the 
accomplished  research  with  respect  to  PRISM  -  the  parallel  problem  solving  system.  The 
second  section,  2.2,  describes  the  efforts  for  the  development  of  parallel  systems  software  and 
hardware  for  experimentation  with  parallel  algorithms. 

2.1.  PRISM  and  Parallel  Problem  Solving  on  McMOB  and  Butterfly 

There  were  seven  major  tasks  in  parallel  problem  solving  undertaken  under  the  current 
grant.  These  are: 

(a)  Implement  PRISM  on  McMOB 

(b)  Application  Studies 

(c)  Alternative  Machine  Configurations  -  Design  of  Combined  PSM/IDB  Machine 

(d)  Develop  New  Features  for  PRISM 

(e)  Alternative  Architectures 
(0  Analyze  Parallel  Algorithms 

(g)  Informative  Answers 

(h)  Control  Structure  Investigation 

2.1.1.  Implement  PRISM  on  McMOB 

We  have  successfully  implemented  PRISM  on  the  McMOB  parallel  machine.  The  work 
involved  implementing  a  communications  package  that  allows  PRISM  programs  to  communi¬ 
cate  over  the  McMOB  belt  All  other  programs  that  had  been  running  on  the  VAX,  SUN,  and 
PYRAMID  machines  were  compiled  for  the  McMOB  machines  and  ran  successfully.  This  work 
is  complete,  including  the  implementation  of  the  EDB  and  Host  processors.  A  paper  has  been 
published  in  the  Sigarch  Newsletter  that  describes  the  system.  [Giuliano,  Kohli  and  Minker] 

2.1.2.  Application  Studies 

A  series  of  application  studies  have  been  performed  using  PRISM  on  McMOB.  Two 
problem  sets  were  developed.  One  was  a  set  of  problems  known  in  the  literature,  and  the 
second  was  a  set  of  programs  generated  from  abstract  tree?.  The  following  results  have  been 


obtained:  speed  ups  are  generally  obtainable  when  more  processors  are  allocated  to  solve  a 
problem.  Splitting  the  intensional  database  component  and  the  problem  solving  component  of 
the  system  into  separate  machines  does  not  appear  to  be  useful.  Preliminary  indications  are 
that  simple  heuristics  can  be  used  to  allocate  problems  to  processors  to  achieve  runtime  speed 
up.  Additional  experimental  work  is  required  before  final  conclusions  can  be  made  on  this 
latter  observation.  A  report  has  been  written  that  describes  results  obtained  on  the  PRISM 
experiments  using  the  McMOB  and  Butterfly  processors  ( Experiments  with  Parallel  Logic  Pro¬ 
gramming  in  PRISM  [Giuliano,  et  al.] ) 

2.1.3.  Alternative  Machine  Configurations  •  Design  of  Combined  PSM/IDB  Machine 

A  design  for  a  combined  PSM/IDB  machine  was  developed  and  the  design  was  imple¬ 
mented  on  McMOB.  Due  to  the  studies  described  above,  we  believe  that  there  is  no  need  to 
have  separate  PSM  and  IDB  machines.  Because  of  this,  we  have  not  implemented  the  EDB 
communication  protocol  for  the  IDB. 

2.1.4.  Develop  New  Features  for  PRISM 

The  PRISM  trace  programs  were  modified  to  incorporate  new  features  of  the  system.  The 
trace  programs  have  now  been  modified  so  as  to  display  AND-parallel  execution  and  the  capa¬ 
bilities  of  the  Constraint  Solving  Machine  (CSM).  The  trace  can  run  either  off-line  or  on-line 
with  the  McMOB  system.  The  user  can  see  the  program  executing  in  alternative  machines  at 
the  same  time  and  can  observe  both  OR-paraliclism  and  AND-parallelism  as  it  is  executing  on 
the  McMOB.  The  addition  of  these  features  allows  us  to  use  the  trace  to  debug  programs  exe¬ 
cuting  on  a  parallel  machine. 

In  addition,  a  list  notation  and  a  Nil.  h  mode  were  added  to  the  PRISM  system.  The 
AND-parallel  and  OR-parallel  PSMs  hj\e  Nvn  combined  into  one  system  so  that  AND/OR- 
parallelism  can  be  executed  simultaneous  rvisM 


2.1.5.  Alternative  Architectures 


We  have  investigated  the  BBN  Butterfly  architecture  to  determine  if  it  is  possible  to 
transfer  PRISM  to  that  machine.  In  contrast  to  McMOB,  the  Butterfly  machine  is  a  shared 
memory  parallel  architecture.  We  effected  a  transfer  to  the  Butterfly  by  simulating  the  McMob 
belt  As  noted  above,  this  system  is  operational  on  the  Butterfly  and  experimental  results  have 
been  obtained  using  the  system.  We  have  also  developed  a  plan  to  utilize  the  shared  memory 
system  of  the  Butterfly  and  expect  to  have  such  a  system  implemented  in  the  coming  grant 

2.1.6.  Analyze  Parallel  Algorithms 

A  new  divide-and-conquer  algorithm,  called  Formula  Dissection  was  developed  for  pro- 
positional  satisfiability.  The  dissection  step  of  the  algorithm  is  implemented  by  bisecting  the 
underlying  graph  representation  of  the  formula.  The  running  time  of  Formula  Dissection  pro¬ 
vides  a  constructive  proof  that  Planar-3SAT,  an  NP-Complete  class  of  SAT  can  be  solved  in 
time  exp(sqrt(n))  as  opposed  to  exp(n).  The  Formula  Dissection  paradigm  can  also  be  applied 
to  solve  other  NP-Complete  problems  such  as  finding  a  Hamiltonian  Cycle  in  a  planar  graph 
and  the  Trihedral  Scene  Recognition  problem  in  computer  vision.  A  paper  on  this  work  was 
submitted  for  publication  in  a  journal.  [Kasif,  Reif  and  Sherlekar] 

2.1.7.  Informative  Answers 

Detailed  heuristics  have  been  developed  to  control  the  natural  language  output  from 
deductive  databases  to  provide  informative  answers  to  queries  posed  by  a  user.  A  paper  has 
been  published  that  describes  some  of  the  heuristics  that  have  been  developed.  [Gal  and 
Minker] 

Implementation  of  part  of  this  effort  has  been  accomplished.  A  technical  report,  [Lobo 
and  Minker],  has  been  written  and  accepted  for  publication  that  describes  a  meta-program  that 
integrates  integrity  constraints  into  a  deductive  database  to  semantically  constrain  the  search 
for  answers.  The  meta-program  also  has  the  ability  to  interface  with  a  relational  database. 


This  basic  capability  had  to  be  developed  prior  to  obtaining  the  full  capability  incorporating 
the  v/ork  in  the  previous  paragraph. 


2.1.8.  Control  Structure  Investigation 

A  Ph.D.  thesis  has  been  written  by  Madhur  Kohli  [Kohli]  that  describes  a  compiler  that 
permits  a  user  to  develop  an  interpreter  with  a  control  capability  specified  for  a  particular 
application.  The  compiler  has  been  written  and  experiments  have  been  conducted  with  the 
compiler  output.  An  interpreter  with  the  PROLOG  control  strategy  was  implemented  and  com¬ 
pared  with  PROLOG.  Tests  run  on  the  VAX  machine  indicate  that  the  compiled  control 
operates  approximately  half  as  fast  as  PROLOG.  However,  for  control  structures  that  have  to 
be  implemented  with  a  meta-interpreter  on  PROLOG,  the  compiled  interpreter  operates  approx¬ 
imately  ten  times  as  fast  as  the  meta-interpreter.  Hence,  the  approach  is  both  significant  and 
viable  for  obtaining  interpreters  with  control  strategies  different  than  that  incorporated  in  PRO¬ 
LOG.  The  results  obtained  are  described  in  Kohli 's  thesis  and  in  [Kohli  and  Minkcr], 

22.  Parallel  Hardware  and  Software 

There  were  four  major  tasks  in  parallel  hardware  and  software  undertaken  under  the 
current  grant  These  were: 

(a)  Theoretical  Slicing  and  Splicing 

(b)  Practical  Slicing  and  Splicing 

(c)  Automatic  Debugging  Methods 

(d)  New  Architectures 

2.2.1.  Theoretical  Slicing  and  Splicing 

A  major  advance  in  slicing/splicing  was  made  this  year  in  the  form  of  a  new  methodol¬ 
ogy  for  performing  the  splicing  process.  The  previous  method  of  preventing  jumbled  outputs, 
due  to  processor  asynchrony,  required  resequencing  the  outputs  of  different  slices  and  thus  each 
slice  was  required  to  continually  broadcast  certain  information  to  a  central  location.  This 


continuous  transmission  caused  a  prohibitive  expense  in  terms  of  overhead  for  run-time  execu¬ 
tion,  and  made  the  rapid  splicing  of  large  programs  practically  impossible. 

The  new  methodology  is  based  on  a  formalization  of  the  splicing  problem  utilizing  graph 
theory.  It  can  be  demonstrated  from  this  formalization  that  certain  constraints  between  outputs 
are  guaranteed  to  hold  and  thus,  instead  of  having  each  splice  continually  broadcast  synchroni¬ 
zation  information  we  need  send  only  a  small  amount  of  information  to  a  central  processor 
when  outputs  occur.  Tags  are  attached  to  the  output  string  which  can  be  deciphered  by  the 
coordinating  processor  and  outputs  can  be  guaranteed  to  be  correctly  sequenced.  These  tags 
are  a  list  of  integers  bounded  by  the  number  of  output  statements  in  a  slice.  We  have  designed 
an  algorithm  which  efficiently  generates  and  decodes  these  tags  at  run-time.  Using  the  graph 
theoretic  basis  of  this  technique  we  have  been  able  to  provide  a  proof  of  the  minimality  of  the 
average  lengths  of  tags. 

The  method  does  not  require  that  the  total  behavior  of  a  program  be  represented  in  its 
slices.  Nor  does  it  require  that  the  slices  of  a  program  be  generated  in  any  particular  manner 
or  be  implemented  in  any  particular  language.  It  only  requires  that  each  slice  of  a  program 
faithfully  reproduce  some  portion  of  that  program's  original  behavior.  The  major  advantage  of 
the  new  method  is  that  it  radically  reduces  the  overhead  associated  with  reconstruction  of  the 
output  [Badger]. 

2J.2.  Practical  slicing  and  spiking 

A  major  bottle  neck  in  the  development  of  practical  slicing/splicing  systems  has  been  the 
run-time  overhead  associated  with  providing  the  synchronization  information  for  the  resequenc¬ 
ing  of  output.  Our  efforts  to  produce  a  production  quality  compiler  were  stymied  by  this 
bottleneck,  and  work  shifted  to  the  theoretical  aspects  of  splicing.  As  reported  above,  the  new 
formalization  of  splicing  has  broken  through  that  problem,  and  work  is  now  proceeding  on  the 
development  of  the  slicing/splicing  compiler  for  large  programs. 


2.2.3.  Automatic  Debugging  Methods 


Under  the  current  grant,  techniques  have  been  explored  for  applying  diagnostic  reasoning 
to  program  debugging.  The  difficulty  of  debugging  parallel  systems  software  motivates  getting 
as  much  automatic  aid  as  possible.  Under  the  current  grant  we  have  focused  on  design  and 
implementation  of  a  prototype  system  which  uses  AI  techniques  to  examine  control  flow  infor¬ 
mation  and  the  values  of  a  variable  as  the  program  runs.  We  have  completed  a  prototype 
implementation  which  examines  several  pieces  of  LISP  code  and  have  shown  that  the  tech¬ 
nique  will  generalize  to  other  interesting  classes  of  bugs.  We  have  also  formalized  our  tech¬ 
nique  and  have  demonstrated  its  validity  on  a  wider  variety  of  bug  cases. 

Our  technique  examines  various  “aspects"  of  the  program  as  it  executes  good  cases,  and 
compares  those  aspects  with  the  same  program  running  bad  cases.  As  computation  progresses 
machine  learning  techniques  are  used  to  induce  “expectations"  of  these  aspects.  An  expecta¬ 
tion  is  simply  some  predicate  that  satisfies  the  observed  behavior.  We  then  attempt  to  match 
the  expectation  derived  from  the  good  cases  with  the  aspect  from  the  anomalous  case.  If  they 
fail  to  match,  examining  this  failure  provides  some  insight  into  the  nature  of  the  bug.  particu¬ 
larly  into  identifying  where  the  error  has  occurred.  [Mazurek] 

An  important  aspect  of  this  work  has  been  an  effort  to  make  the  debugging  techniques  as 
language  independent  as  possible.  While  the  learning  and  diagnostic  techniques  require  AI 
referencing  capabilities,  and  are  thus  best  implemented  in  an  AI  language  such  as  LISP,  the 
program  control-flow  analysis  does  not  require  that  the  data  be  a  LISP  program.  To  demon¬ 
strate  this  we  have  used  the  multiple  language  capabilities  of  the  Texas  Instruments  Explorer 
machine.  These  capabilities  permit  shared  memory  communications  between  UNIX  and  LISP 
processor  boards.  We  have  implemented  a  version  of  our  prototype  which  runs  on  this  architec¬ 
ture  permitting  the  LISP  processor  to  monitor  the  function  of  programs  running  on  the  UNIX 
board.  Using  this  technique  the  LISP-based  debugger  can  work  on  code  written  in  C  or  other 
procedural  languages.  A  demonstration  of  this  system  will  be  presented  at  this  year’s  Ameri¬ 
can  Association  of  Artificial  Intelligence  Conference  in  Seattle. 


2.2.4.  New  architectures 

The  major  accomplishment  of  this  grant  period  was  the  completion  of  the  McMOB  sys¬ 
tem.  The  system  has  now  been  tested  and  debugged  and  the  operating  system  is  completely 
functional.  The  system  is  now  in  use  by  departmental  personnel  and  support  of  the  machine  is 

p 

being  handled  by  the  department. 
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